JOURNAL OF VIROLOGY, Feb. 2005, p. 2001-2009 
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.4.2001—2009.2005 


Vol. 79, No. 4 


Copyright © 2005, American Society for Microbiology. All Rights Reserved. 


Identification of a Novel Coronavirus in Bats 


L. L. M. Poon,'* D. K. W. Chu,’ K. H. Chan,” O. K. Wong,’ T. M. Ellis,? Y. H. C. Leung,' 
S. K. P. Lau,’ P. C. Y. Woo,’ K. Y. Suen,* K. Y. Yuen,! Y. Guan,! 
and J. S. M. Peiris'* 
Department of Microbiology, The University of Hong Kong,’ Department of Microbiology, Queen Mary Hospital,” 


Department of Agriculture, Fisheries, and Conservation,* and Wildlife Conservation Foundation Limited,* 
Hong Kong, Special Administrative Region, People’s Republic of China 


Received 26 July 2004/Accepted 1 October 2004 


Exotic wildlife can act as reservoirs of diseases that are endemic in the area or can be the source of new 
emerging diseases through interspecies transmission. The recent emergence of severe acute respiratory syn- 
drome-associated coronavirus (SARS-CoV) highlights the importance of virus surveillance in wild animals. 
Here, we report the identification of a novel bat coronavirus through surveillance of coronaviruses in wildlife. 
Analyses of the RNA sequence from the ORF1b and S-gene regions indicated that the virus is a group 1 
coronavirus. The virus was detected in fecal and respiratory samples from three bat species (Miniopterus spp.). 
In particular, 63% (12 of 19) of fecal samples from Miniopterus pusillus were positive for the virus. These 
findings suggest that this virus might be commonly circulating in M. pusillus in Hong Kong. 


Coronaviruses are positive-stranded RNA viruses. The viral 
genomes are between 29 and 32 kb long and are packaged in 
enveloped virions with corona-like morphology (5). The viral 
genomes contain five major open reading frames (ORFs) that 
encode the replicase polyproteins (ORFla and ORF lab), the 
spike (S), envelope (E), and membrane (M) glycoproteins, and 
the nucleocapsid protein (N) (5, 10). Of these proteins, the 
replicase polyproteins are directly translated from the viral 
genome. Translation of ORFla-encoding sequence can be ex- 
tended with ORF1b-encoding sequences by a —1 ribosomal 
frameshift to synthesize ORFlab polyprotein (5). By contrast, 
other viral proteins are translated from subgenomic mRNA 
molecules that are synthesized by a discontinuous RNA syn- 
thesis mechanism (14, 22). The ORFla and ORF 1ab polypro- 
teins are nonstructural proteins (6) and are cleaved by papain- 
like cysteine and 3CL proteinases to generate functional units 
for viral transcription and replication (26). S, E, M, and N 
proteins are structural proteins. S is responsible for receptor 
binding (4). E and M are integral membrane proteins and are 
the minimal set of proteins for virus assembly (1). N protein is 
an internal protein, and it binds to viral RNA to form ribonu- 
cleoprotein complex (11). 

The majority of coronaviruses are disease-causing agents 
(16). Human coronaviruses are associated with respiratory and 
gastrointestinal diseases, and animal coronaviruses cause se- 
vere respiratory, enteric, neurological, or hepatic disease in their 
hosts. On the basis of antigenic and genetic analyses, corona- 
viruses are subdivided into three groups (groups 1 to 3) (5, 7). 
Group | viruses include human coronaviruses NL63 (HCoV- 
NL63) and 229E (HCoV-229E), canine coronavirus (CCoV), 
porcine transmissible gastroenteritis virus (TGEV), porcine 
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epidemic diarrhea virus (PEDV), and feline infectious perito- 
nitis virus (FIPV). Group 2 viruses include human coronavirus 
O0C43 (HCoV-OC43), bovine coronavirus (BCoV), and mu- 
rine hepatitis virus (MHV). Group 3 viruses are avian viruses, 
such as avian infectious bronchitis virus (IBV) and turkey coro- 
navirus (TCoV). 

Of the coronaviruses identified hitherto, most were isolat- 
ed from humans, pets, pigs, cattle, or poultry. This bias is 
presumably because viral investigations are often driven by 
disease outbreaks in the above populations. By contrast, inves- 
tigations of wildlife are rare (20, 24), and relatively little is 
known about the prevalence of coronavirus in wild animal 
species (3). The identification of severe acute respiratory 
syndrome-associated coronavirus (SARS-CoV) in civet cats 
and other wild animals in live animal markets suggests that 
this novel human pathogen emerged as a result of an inter- 
species transmission (8). More importantly, these findings high- 
light the potential human health risk posed by coronaviruses in 
wild animals. This has prompted us to launch a survey of 
the prevalence of coronavirus in wild animals in Hong Kong. 
In particular, we were interested in determining whether wild 
animals living in this geographical region carry the precursor 
of SARS-CoV or other unidentified coronaviruses. Here, we 
report the identification of a novel bat coronavirus (BAT- 
CoV). 


MATERIALS AND METHODS 


Sample collection. The animal surveillance program was performed between 
the summer of 2003 and the summer of 2004. The study was approved and 
supported by the Department of Agriculture, Fisheries and Conservation, Hong 
Kong, Special Administrative Region, People’s Republic of China. Small mam- 
malian, avian, and reptile species living in natural reservoirs or country parks in 
Hong Kong were studied. Animals were trapped, and respiratory and fecal swab 
samples were collected. Before samples were taken, all animals were examined 
by a veterinary surgeon and confirmed to be free of overt disease. All captured 
animals were released after samples were taken. Samples were kept in viral 
transport medium (Earle’s balanced salt solution, 0.2% sodium bicarbonate, 
0.5% bovine serum albumin, 200 jg of vancomycin per liter, 18 jg of amikacin 
per liter, 160 U of nystatin per liter) at 4°C. In addition, blood samples were 
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A NOVEL CORONAVIRUS IN BATS — 2003 


TABLE 1. Prevalence of Bat-Cov in animals 


Animal type and 


Common name 


No. of individual No. of positive respiratory No. of positive fecal 


scientific name animals samples samples 
Mammals 
Cynopterus sphinx Short-nosed fruit bat 15 0 0 
Hipposideros armiger Great round-leaf bat 4 0 0 
Hipposideros pomona Bicolored round-leaf bat 3 0 0 
Miniopterus magnater Large bent-winged bat 16 1 2 
Miniopterus pusillus Lesser bent-winged bat 19 5 12 
Miniopterus schreibersii Japanese long-winged bat 4 1 1 
Myotis myotis Large mouse-eared bat 3 0 0 
Myotis ricketti Rickett’s big-footed bat 5 0 0 
Pipistrellus abramus Japanese pipistrelle bat 3 0 0 
Rhinolophus affinis Intermediate horseshoe bat 2 0 0 
Rhinolophus pusillus Least horseshoe bat 1 0 0 
Rhinolophus rouxi Rufous horseshoe bat 6 0 0 
Canis familiaris Feral dog 6 0 0 
Felis catus Feral cat 1 0 0 
Herpestes javanicus Javan mongoose 1 0 0 
Herpestes urva Crab-eating mongoose 1 0 0 
Hystrix hodgsoni Chinese porcupine 10 0 0 
Macaca mulatta Rhesus macaque 6 0 0 
Melogale moschata Chinese ferret badger 8 0 0 
Muntiacus muntjak Indian muntjac 4 0 0 
Paguma larvata Himalayan palm civet 21 0 0 
Rattus rattus Black rat 6 0 0 
Viverricula indica Small indian civet 4 0 0 
Sus scrofa Wild boar 9 0 0 
Reptiles 
Naja atra Chinese cobra 1 0 0 
Opisthotropis balteata Banded stream snake 1 0 0 
Ptyas korros Indo-Chinese rat snake 2 0 0 
Trachemys scrpta elegants Red-eared slider 1 0 0 
Rhabdophis subminiatus helleri Red-necked keelback 1 0 0 
Trimeresurus albolabris Bamboo snake 1 0 0 
Sibynophis chinensis Chinese mountain snake 1 0 0 
Sinonatrix percarinata Mountain water snake 1 0 0 
Elaphe radiata Copperhead racer 1 0 0 
Birds 
Chalcophaps indica Emerald dove 1 0 0 
Garrulax pectoralis Greater necklaced laughing thrush 1 0 0 
Garrulax perspicillatus Masked laughing thrush 2 0 0 
Myophonus caeruleus Blue whistling thrush 3 0 0 
Scolopax rusticola Eurasian woodcock 1 0 0 
Streptopelia chinensis Spotted dove 3 0 0 
Streptopelia orientalis Oriental turtle dove 3 0 0 
Turdus cardis Japanese thrush 1 0 0 
Turdus obscurus Eyebrowed thrush 1 0 0 
Turdus ohortulorum Grey-backed thrush 1 0 0 
Zoothera dauma Scaly thrush 1 0 0 


drawn from captured Himalayan palm civets for neutralization assays for 
SARS-CoV (8). The neutralization assays were performed as described pre- 
viously (8). 

RNA extraction and reverse transcription. RNA from 140 wl of the sample 
was extracted by QIAamp virus RNA mini kit (QIAGEN) following the manu- 
facturer’s instructions. Extracted RNA was eluted in 50 pl of RNase-free water 
and stored at —20°C. cDNA was generated as described previously (19). Briefly, 
10 yl of eluted RNA samples was reverse transcribed by 200 U of Superscript II 
reverse transcriptase (Invitrogen) in a 20-1 reaction mixture containing 0.15 pg 
of random hexamers, 10 mmol of dithiothreitol per liter, and 0.5 mmol of 


deoxynucleoside triphosphate per liter. Reaction mixtures were incubated at 
42°C for 50 min, followed by a heat inactivation step (72°C for 15 min). Reverse- 
transcribed products were stored at —20°C. 

PCR and sequencing. A pair of consensus primers targeted to the conserved 
region of coronavirus RNA polymerase (RNA-dependent RNA polymerase 
[RdRp]) sequences was used to screen the RNA samples (PCR 2 in Table 2). In 
a typical PCR, 2 wl of CDNA was amplified in a 50-l reaction mixture containing 
0.2 mmol of deoxynucleoside triphosphates per liter, 3 mmol of MgCl, per liter, 
0.5 pmol of forward primer per liter, 0.5 pmol of reverse primer per liter, and 
0.25 U of AmpliTaq Gold (Applied Biosystems). PCR was performed as follows: 


FIG. 1. (A) Protein sequence alignment of coronavirus RdRps. The conserved motifs for RdRps are indicated above the sequences. Specimens 
collected from M. pusillus at geographical site 1 (*), M. pusillus at geographical site 2 (1), and M. schreibersii (specimen 86) and M. magnater 
(specimens 88 and 96) at site 3 (+) are indicated. 229e, HCoV-229E. (B) Phylogenetic analysis of RNA sequences encoding RdRp (partial 


sequence). 
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TABLE 2. Primer sequences positive for Bat-CoV detection 


PCR Target ates Primer sequence (5’ to 3’) 
sequence orientation 
1 RdRp Forward AYAACCAAGATCTTAATGG 
Reverse TGCTTAGAACCCAAAATCAT 
2° RdRp Forward GGTTGGGACTATCCTAAGTGTGA 


Reverse CCATCATCAGATAGAATCATCATA 

3 Helicase-ExoN Forward CTCARGGTAGTGARTATGA 
Reverse AATTGTTCWCCWGGTGG 

4 Spike Forward WTATGTTTGYAATGGTAAY 
Reverse GTCWTCATCMACWGTRC 

5 Spike Forward GAYTDDCAGCACTTAATGC 
Reverse TTGAGCCAYTCAAGRTYRA 

6 Spike Forward CAATCTAGGTCTGCTATCG 
Reverse CTAGAAGACTGTGATTTGA 


* Forward and reverse primers (IN-6 and IN-7) were communicated through 
the World Health Organization’s SARS etiology network by colleagues from the 
Centers for Disease Control and Prevention. 


(i) 10 min at 95°C; (ii) 40 cycles, with 1 cycle consisting of 1 min at 95°C, 1 min 
at 58°C, and 1 min at 72°C. Amplified DNA products were analyzed by agarose 
gel electrophoresis and DNA sequencing. To avoid possible contamination, 
RNA extraction, reverse transcription-PCR (RT-PCR), and gel electrophoresis 
were preformed in separate laboratories. In addition, water controls were in- 
cluded in each run of the RT-PCR assay, and no false-positive result was ob- 
served in the negative-control reactions. 

In subsequent experiments, primers targeted to the RdRp, helicase-ExoN, and 
S-encoding sequences were used to determine the Bat-CoV sequence (Table 2, 
PCRs 1 and 3 to 6). PCR conditions of these assays were identical to the PCR 
assay described above. DNA products with expected sizes were purified by 
QIAquick PCR purification kit (QIAGEN) and were cloned into DNA vectors 
(pCR-TOPO; Invitrogen). DNA inserts in purified plasmid were sequenced by 
BigDye Terminator v3.1 Cycle Sequencing kit (Applied Biosystems). Sequencing 
products were analyzed by ABI PRISM 3700 DNA analyzer (Applied Biosys- 
tems). Both sense and antisense sequences of these PCR products were se- 
quenced at least once. 

Data analysis. Deduced viral sequences were analyzed and aligned by BioEdit, 
version 5.0.9 (http://www.mbio.ncsu.edu/BioEdit/bioedit.html). Phylogenetic 
trees were constructed by the neighbor-joining method, and bootstrap values 
were determined by 1,000 replicates in MEGA 2.1 (http://www.megasoftware 
net). Potential glycosylation sites of the S protein were predicted by NetNGlyc 
1.0 Server (http://vww.cbs.dtu.dk/services/NetNGlyc/). Stablecoil 1.0 (http:/Avww 
.pence.ca/stablecoil/) was used to detect the heptad repeat (HR) regions of the 
S protein. Reference sequences used in the study are FIPV (GenBank accession 
number AB088222), TGEV (NC002306), HCoV-229E (AF304460), HCoV-NL63 
(AY567487), HCoV-OC43 (NC005147), CCoV (AY342160), PEDV (AF353511), 
IBV (NC005147), MHV (NC001846), and SARS-CoV (NC004718). 

Nucleotide sequence accession numbers. The deduced sequences from this 
study were deposited in GenBank under accession numbers AY864196 to 
AY864198. 


RESULTS 


Surveillance. A total of 162 swab samples from 12 bat spe- 
cies and 176 swab samples from 32 other animal species (Table 
1) were collected. To identify coronaviruses from these speci- 
mens, a pair of consensus primers that can cross-react with a 
number of coronavirus RdRp sequences were used to screen 
the field samples (Table 2, PCR 2). Positive PCR amplicons 
were detected in three different bat species (Miniopterus spp. 
[Table 1]). In particular, 12 of 19 (63%) fecal swab samples 
from Miniopterus pusillus were positive in the screening test. 
These specimens were collected from three different geograph- 
ical locations (Fig. 1A). 

Identification of a Bat-CoV. To characterize the RdRp se- 
quence deduced from PCR 2 (Table 2), amplicons from each 
animal were subjected to DNA sequencing. All the sequences 
generated from M. pusillus were highly similar (Fig. 1A). Al- 
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though there are sequence polymorphisms among these se- 
quences, these sequences form a distinct branch within the 
clade of group 1 coronavirus RdRp sequences (data not shown). 
These results suggested that the coronavirus circulating in 
M. pusillus is a novel virus. More interestingly, viral sequences 
deduced from Miniopterus magnater (Fig. 1A, samples 88 and 
96) and Miniopterus schreibersii (Fig. 1A, sample 86) were also 
highly similar to those sequences from M. pusillus (Fig. 1A), 
indicating that viruses isolated from these three bat species are 
the same virus or are of the same lineage. 

Attempts to isolate this virus in cell cultures were made. 
Fecal and respiratory samples were used to infect Madin- 
Darby canine kidney (MDCK), fetal rhesus kidney (FRhK4) 
and African green monkey kidney (Vero E6) cells. However, 
no evidence of virus replication was detected in these cells by 
cytopathic effect or by RT-PCR. 

Characterization of Bat-CoV RdRp, helicase-ExoN, and S 
sequences. A representative sample from M. pusillus (spec- 
imen 61 [Fig. 1A]) was selected for further sequence analysis. 
As mentioned above, our preliminary data indicated that this 
novel virus is a group 1 coronavirus. On the basis of the con- 
served regions of group 1 viruses (HCoV-229E, HCoV-NL63, 
PEDV, TGEV, FIPV, and CCoV), 30 sets of primers were 
generated for the determination of the viral sequences. Of 
these PCRs, five additional viral RNA sequences were de- 
duced (Table 2, PCRs 1 and 3 to 6). Alignments of these 
deduced sequences generated three RNA fragments contain- 
ing partial sequences for the (i) RdRp gene, (ii) genes C 
terminal of the helicase gene and N terminal of the hypothet- 
ical exonuclease (ExoN) gene, and (iii) S genes (Table 3). The 
percent sequence identity of each RNA fragment to group 1 to 
3 viruses was determined (Table 4). All these RNA sequences 
shared the highest identity with group 1 coronaviruses (54 to 
75%). Phylogenetic analyses of these viral sequences resulted 
in consensus trees with similar topologies (Fig. 1B, Fig. 3A, 
and Fig. 4A). In all cases, Bat-CoV clustered with group 1 
coronaviruses. In particular, the RdRp and S sequences were 
most closely related to those of PEDV. 

Sequence analysis of the partial sequence of the S gene 
reveal that this RNA sequence encodes the S2 protein subunit 
(2, 23). The S protein of coronavirus is known to be heavily 
glycosylated, and 11 potential N-glycosylation sites were iden- 
tified (Fig. 2). The deduced sequence contains the HR1 region 
and part of the sequence of the HR2 region (Fig. 3B and C). 
These HR regions were separated by an interhelical domain of 
~130 amino acid residues (Fig. 2). As with other group 1 
coronaviruses, Bat-CoV also has 14 “additional” amino acid 
residues in both HR regions (Fig. 3C) (2). 

The partial RdRp protein sequence of Bat-CoV contains 


TABLE 3. Information about Bat-CoV sequences 
deduced in this study 


Length of deduced protein 


RNA Length ‘ Encoding 
sequence (nt)! sequence (no. of amino tépion 
q acid residues) 8 
1 1,613 537 RdRp 
2 591 197 Helicase-ExoN 
3 1,448 482 S 


* nt, nucleotides. 
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TABLE 4. Nucleotide sequence identities of Bat-CoV RNA fragments to other coronaviruses 


% Identity of RNA fragment to: 


RNA fragment 


PEDV TGEV HCoV-229E HCoV-NL63 MHV HCoV-OC43 SARS-CoV IBV 
RdRp 74 70 75 74 60 61 62 63 
Helicase-ExoN a, 67 71 72 55 55 55 55 
S 58 54 58 60 40 42 41 43 


several conserved motifs of RdRps (motifs A to C, G, and F in 
Fig. 1A) (18, 25). Motif A has two conserved Asp residues 
separated by four residues and is known for metal ion binding 
and for recognition of the ribonucleoside triphosphate sugar 
ring. Motif B contains the highly conserved Ser, Gly, Thr, and 
Asn residues and is known to be involved in selection of the 
correct ribonucleoside triphosphate substrate. Motif C con- 
tains the highly conserved SDD sequence and is associated 
with metal ion and 3’-primer terminus binding. Motif F con- 
tains several conserved positively charged basic residues and 
is associated with nucleoside triphosphate binding. This motif 
could be divided into three submotifs (Fl to F3) (25). Like 
other coronaviruses, the F motif of Bat-CoV lacks the F2 
submotif. The biological functions of the F2 submotif in other 
viruses are yet to be determined. The G motif had a conserved 
SXGXP sequence and is known to be involved in positioning 
of the 5’ template strand in other RdRps. 

The RNA fragment for the helicase-ExoN junction encodes 
the last 54 amino acid residues of the RNA helicase and the 
first 143 amino acid residues of the putative exonuclease (Fig. 
4B) (21). The partial sequence for the helicase protein con- 
tains the conserved helicase motifs V and VI (12). The partial 
sequence for the putative ExoN protein contains motif 1 of the 
DEDD exonuclease superfamily (21). At the junction of these 
two proteins, a conserved cleavage signal for 3CL proteinase 
was identified (LQS at positions P2 to P1’), which suggests that 
the junction may be cleaved by 3CL proteinase expressed by 
Bat-CoV. 


DISCUSSION 


Interspecies transmissions of animal viruses to humans are 
permanent threats to human health. The recent transmis- 
sions of SARS-CoV (8), West Nile virus (13), Nipah virus 
(17), and avian influenza virus (15) from animals to humans 
have highlighted the importance of surveillance of viruses 
in wildlife. In order to obtain a better understanding of the 
prevalence of coronaviruses in this geographical region, we 
collected more than 300 animal samples from 44 animal 
species. Using a pair of consensus primers for RdRp of coro- 
naviruses, a novel group 1 virus was identified in three Min- 
iopterus spp. 

Of 12 bat species examined, the novel Bat-CoV was identi- 
fied in three different bat species from the same genus. Inter- 
estingly, Myotis chinensis and Myotis ricketti, which frequently 
cohabit with M. pusillus (K. Y. Suen, personal communication), 
were negative in this investigation. These results demonstrate 
that this virus has a narrow host range. As the viral sequences 
in Miniopterus spp. are highly similar, our data imply that there 
are frequent interspecies transmissions between these species. 
It is not certain which Miniopterus species is the natural host of 


the virus. However, the majority of M. pusillus bats were found 
to be infected by this virus. In addition, the detection rates of 
this virus in M. pusillus in the summer of 2003 and 2004 were 
similar (57% in 2003 and 67% in 2004). These observations 
suggest that M. pusillus is likely to be the major reservoir of this 
virus. As the majority of M. pusillus bats in Hong Kong are 
known to migrate to tropical areas to overwinter (K. Y. Suen, 
personal communication), we do not know the prevalence of 
Bat-CoV in this species in winter. Further work on the ecology 
and behavior of these three bat species in Hong Kong is re- 
quired to better understand the dynamic of this virus in these 
animals. 

Although all of the infected bats were shown to be healthy 
upon physical examination, it is not known whether this novel 
virus is pathogenic in bats. Both fecal and respiratory samples 
were positive for the virus. However, more than 50% of fecal 
samples from M. pusillus contained this novel virus compared 
to 26% of respiratory specimens from the same bats, imply- 
ing that this virus may have a predominantly enteric tropism. 
Further work is required to elucidate the persistence of the 
infection, tissue tropism, and possible pathogenicity of this 
virus. 

The deduced Bat-CoV sequences have the typical features 
of coronaviruses. The virus has the highest sequence identity to 
group 1 coronaviruses but is clearly distinct from previously 
known group 1 viruses. Our phylogenic analyses of these viral 
sequences also suggest that the virus is a group 1 virus. These 
findings are further supported by the fact that the S protein of 
Bat-CoV contains the unique signature of group 1 coronavi- 
ruses (i.e., the unique 14 amino acids in HR1 and HR2) (2). 
Recombination is common in coronaviruses and is thought to 
contribute to the emergence of new coronaviruses (9). Our 
limited sequencing results do not allow us to draw any conclu- 
sions of the origin of this virus. However, the low sequence 
homology between Bat-CoV sequences and other coronavi- 
rus sequences at least suggest that this novel virus is not a 
recent recombinant from existing coronaviruses. We are cur- 
rently attempting to sequence the rest of the viral genome 
for a full sequence characterization of this virus. However, 
these efforts are hampered by the inability to culture the 
virus in vitro. 

Apart from the Bat-CoV, we did not identify other corona- 
viruses in our samples. However, one should note that our 
results could not reveal a complete picture of the prevalence of 
coronaviruses in this geographical region. First, our test relies 
on the detection of viral sequence. Animals with a past coro- 
navirus infection would be negative in our assay. Second, due 
to the limited sample sizes of each animal species, we might 
miss the viruses which are circulating in low frequency. Be- 
sides, the conserved primers used in this study were based on 
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BAT CoV 1 ~+--DTFDLTLALPR- ------------------- “OR 65 

PEDV 1 GD- - -GYNFTNVLGASV- - ----------- YDPASGRV 74 

TEV 1 ~S-~--SETEDPIYKEWPNIGGSWLEGLKYILPSHNEKR 86 

NL63 1 RNIRSERI 70 

2295 1 LPTSGSRV 70 

FIPV 1 ATAPGGEKLGGLYFDG------------- LSSLLPERV 77 

Canine Cov 1 -§---TETLDPLYKEWPNIGGSWLGGLKDILPSHNEKER 86 

MHV 1 HL DNMQLOVASALMOGVTESSRLPDG- ~ -ISGFIDDINFSPLLG CIGSTCAED 76 

0c43 1 SONRINA IMATE VNEWLDTTOLOVANSLMNGVIWSTKLKDG- CLGSECSRA 76 

SARS-CoV GF @TOLNRAMSG 1BAEODRNTREVFAQVKOMYKTPTLKY§G- - --------GFNFS-------------- QULPDPIKP 66 

IBV 1 P Be venavecriondarycorecncanieion Senna LLLTNPS&RR 70 

BAT Cov 66 YI \ 144 
PEDV 75 sy 1 WAH Bog: SAS 152 
TGV 87 SA oe ah cage VLE ; tc: 164 
NL63 71 ASA vy c DLEICAG paver V MA I 147 
2298 71 J cea eaten teed V {AMG ; 147 
FIPV 78 SANG \ RES SRY KRICEING : CAQMYNGIBIVL FSV] AMS ] 154 
Canine CoV 387 cfu 164 
MHV 77 159 
0c43 77 152 
SARS-CoV 67 146 
IBV 71 147° 
BAT Cov 145 234 
PEDV 153 242 
TGV 165 254 
NL63 148 237 
229E 148 237 
FIPV 155 244 
Canine Cov 165 254 
MHV 160 235 
0c43 153 228 
SARS-CoV 147 222 
IBV 148 223 
BAT Cov 235 2 ALNE SOMLRY Sie LEW SK c 1 ENG 323 
PEDV 243 fe ache s ) 332 
TGV 255 Ma TSgs 41 Y ARLD Ee eTETR PAKDES mS SLANAING 343 
NL63 238 Cie bee eee 2 IND 326 
2295 PEN: BArSSS18 WREsvs' ‘ 2 ‘ 326 
FIPV 245 vi ; NECVKSH 333 
Canine Cov 255 / ; y is sae aL 1 343 
MHV 236 5; ic Ny 324 
0c43 229 1 317 
SARS-CoV 223 ) 311 
IBV 224 § | 312 
BAT Cov 324 405 
PEDV 333 419 
TGV 344 428 
NL63 327 407 
2295 327 407 
FIPV 334 414 
Canine CoV 344. q 428 
MHV 325 Ye ; DKNSWIMSSWAUNYTKAPE 403 
0c43 318 YeL Y is at 396 
SARS-CoV 312_-He ‘ 389 
IBV 313 397 
BAT Cov 406 NEMPELUS-pyip KEFAATI - PNRTEM-- TLNTY Mee DISVNSLTLOASELTKIAS 

PEDV 420 bg PDVElg-DYIp is EILASL-PNRTGS- -S| pLovelise ia (i ADLEQRSESLRNTTE 

TGV 429 PSI FW -DYIBINOTVODILENFRPNWTVg- - ERTFDI NTMI gL UDLEFRSEKLHNTTV 

NL63 408 ee TV -DY VIVINK THOR FAONL - PKYVHg- - NFDLTPE UKQLEAKTASLFOTTV: 

2295 408 SEMOTIVE-EYID LS YKL- PNYTVpg-- EOVMNON INET SBISTLENKSAELNYTVO 

FIPV 415 TrEOER DYIMINKTT ADMLEQYYSNYTTEELDMOLET FeAl DOLEQRADNLTNIAH 


0Cc43 397 TSU -NLPRFKEE WEKNQ- -TSV: 
SARS-CoV 390 TVYDPLOW -ELDSFKEEMDKYFKNHTSPDVDLG- - -DISGINAS MEO. SiD-- 


Canine Cov 429 PSE id - DY TW TNOTVODILENFRPNWTVg- - EMPLDIE RAM YIBMET CT NDLEFRSEKLHNTTV 
MHV 404 : TSify-NPPRFKEEMDEWFKNO - - TSI pDLSPOFE i | Teas 
IBV 398 TVETTFVDNUDFEFRDEMSKWWNDT - -KHEL- -DFDKFNYTVPIMDIDSBIb-------------- iT 


FIG. 2. Protein sequence alignment of coronavirus S proteins (partial sequence). The HR1 and HR2 regions are indicated. The locations of 
potential N-glycosylation sites in the Bat-CoV sequence are marked by asterisks. TGV, TGEV; NL63, HCoV-N63; 229E, HCoV-229E; OC43, 
HCoV-OC43. 
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Residue position 


HR1 sequence: 


da ada ad ad ad a ad a a 
LAVQSRLNYLALOTDVLQRNQOKILAQSFNAAMGNITVAFSGVSNAIQOQTV 
a da ad a ad ad ad a dad a 
SSLTTISRALNKVENVVNEQGTALSQLTKQLASNFQAISSSIEDLYNRLD 
d ad ada ad ad a dada a ada 

RLEADQQVDRLITGRLAALNAFVTQQLLRYSEVRASROLAQEKINECVK 


Partial HR2 sequence: 
a dadadadqeadqeadqeadadadaia 
NRTEMQLTLNTYNATVLNLTDEVNSLTLQASELTKIASELNLTISKINNTLV 


FIG. 3. (A) Phylogenetic analysis of RNA sequences for the S gene (partial sequence). (B) Predicted coiled-coil regions in the deduced 
S-protein sequence. The coiled-coil regions were predicted by Stablecoil 1.0 with a 35-residue window width. The HR1 and HR2 regions are 
indicated. (C) HR1 and HR2 in the S protein of Bat-CoV. The a and d positions of the strongest predicted coiled-coil heptad repeats are indicated. 
The 14-amino-acid residue insertions that are unique in group 1| viruses are underlined. 


available coronavirus sequences, and these primers might not 
be able to detect coronaviruses that are genetically more di- 
vergent from previously known coronaviruses. 

We previously reported that SARS-CoV could be isolated 
from Himalayan palm civets (8). It should be noted that the 
SARS-CoV-positive animals in our previous study were ob- 
tained from wild animal markets and not captured in the wild. 
None of the civets examined in our current study (7 = 21) was 
positive for SARS-CoV by both serological and molecular 
tests. The results from our present study do not exclude the 
possibility that the civet is the natural host of SARS-CoV, but 
our results at least indicate that SARS-CoV is not broadly 


circulating in wild civets. Further investigations are required to 
elucidate the natural reservoir of SARS-CoV, especially in 
mainland China. 

In conclusion, we reported that a novel coronavirus was 
identified from bats. The virus has the highest homology to 
group | coronaviruses. Bats are the reservoir for lyssaviruses 
and henipaviruses and are responsible for emerging diseases in 
humans. It is not known whether this virus would cause zoo- 
notic disease in humans or other animals. Further investi- 
gations are needed to understand the ecology and pathoge- 
nicity of this virus. In addition, this study also highlighted 
our poor understanding of viruses in wild animals. Given the 
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A) 98 HCoV-229E 

HCoV-NL63 

Bat-CoV Group 1 
PEDV 

TGEV 


IBV Group 3 


SARS-CoV 
HCoV-0C43 Group 2 
MHV 


59 


B) Helicase Helicase 
Motif V Motif VI 
—> > 


Bat-Cov 
PEDV 

TGEV 
HCovV-229E 
HCoV-NL63 
MEV 
HCov-0C43 
SARS-CoV 
IBV 


PRE RPRPH RP Re 


Bat-Cov 52 
PEDV 52 
TGEV 54 
HCoV-229E 52 
HCoV-NL63 52 
MEV 57 
HCoV-0C43 60 
SARS-CoV 57 
IBV 54 


Bat-Cov 109 Fe i 4 y e© 168 
PEDV 109 ie NY PLOWGES 168 
TGEV 111 euGTN pLagorssic 170 
HCoV-229E 109 Mi GI omigeLogorsic 168 

168 


HCoV-NL63 109 Ts A ICINL Ie WP LOWGF Si 
MHV 113 SEK 
HCoV-0C43 116 
SARS-CoV 114 
IBV 110 


y 172 
175 
fe 173 
y 169 


Bat-Cov 169 
PEDV 169 
TGEV 171 
HCoV-229E 169 

HCoV-NL63 169 
MHV 173 3 
HCov-0C43 176 
SARS-Cov 174 
IBV 170 


FIG. 4. (A) Phylogenetic analysis of RNA sequences coding for helicase-ExoN (partial sequence). (B) Protein sequence alignment of 
coronavirus helicase-ExoN. The conserved motifs for helicases and the first motif for ExoN (DEDD motif I) are indicated. The invariant acidic 
residues in DEDD motif 1 are labeled with white stars below the sequences. The inverted open triangle above the sequences marks the predicted 
3CL proteinase cleavage site. 
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catastrophic consequences of SARS, further surveillance work 
on viruses in wildlife should be encouraged. 
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